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Abstract This paper presents a novel decentralized 
control strategy for a multi-robot system that enables 
parallel multi-target exploration while ensuring a time- 
varying connected topology in cluttered 3D environ¬ 
ments. Flexible continuous connectivity is guaranteed 
by building upon a recent connectivity maintenance 
method, in which limited range, line-of-sight visibility, 
and collision avoidance are taken into account at the 
same time. Completeness of the decentralized multi¬ 
target exploration algorithm is guaranteed by dynami¬ 
cally assigning the robots with different motion behav¬ 
iors during the exploration task. One major group is 
subject to a suitable downscaling of the main traveling 
force based on the traveling efficiency of the current 
leader and the direction alignment between traveling 
and connectivity force. This supports the leader in al¬ 
ways reaching its current target and, on a larger time 
horizon, that the whole team realizes the overall task 
in finite time. Extensive Monte Carlo simulations with 
a group of several quadrotor UAVs show the scalability 
and effectiveness of the proposed method and experi¬ 
ments validate its practicability. 
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1 Introduction 


Success of multi-robot systems is based on their abil¬ 
ity of parallelizing the execution of several small tasks 
composing a larger complex mission such as, for in¬ 
stance, the inspection of a certain number of locations 
either generated off- or online during the robot motion 
(e.g., exploration, data collection, surveillance, large- 


scale medical supply or search and rescue (Howard et al 


2006[ Franchi et al|2009 Pasqualetti et al|2012[ Murphy 


et al|2008 Faigl and Hollingerj2014 |). In all these cases, 

a fundamental difference between a group of many sin¬ 
gle robots and a multi-robot system is the ability to 
communicate (either explicitly or implicitly) in order 
to then cooperate together towards a common objec¬ 
tive. Another distinctive characteristic in multi-robot 
systems is the absence of central planning units, as well 
as all-to-all communication infrastructures, leading to 
a decentralized approach for algorithmic design and im¬ 


plementations (Lynch 19971. While communication of 


a robot with every other robot in the group (via mul¬ 
tiple hops) would still be possible as long as the group 
stays connected, in a decentralized approach each robot 
is only assumed able to communicate with the robots 
in its 1-hop neighborhood (i.e., typically the ones spa¬ 
tially close by). This brings the advantage of scalability 
in communication and computation complexity when 
considering groups of many robots. 

The possibility for every robot to share informa¬ 
tion (via, possibly, multiple hops/iterations) with any 
other robot in the group is a basic requirement for typ¬ 
ical multi-robot algorithms and, as well-known, it is 
directly related to the connectivity of the underlying 
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graph modeling inter-robot interactions. Graph connec¬ 
tivity is a prerequisite to properly fuse the informa¬ 
tion collected by each robot, e.g., for mapping, local¬ 
ization, and for deciding the next actions to be taken. 
Additionally, many distributed algorithms like consen¬ 
sus (Olfati-Saber and Murray]|2004 1 and flooding (Lim 


and Kim|2001 1 require a connected graph for their suc¬ 


cessful convergence. Preserving graph connectivity dur¬ 
ing the robot motion is, thus, a fundamental require¬ 
ment; however, connectivity maintenance may not be a 
trivial task in many situations, e.g., because of limited 
capabilities of onboard sensing/communication devices 
which can be hindered by constraints such as occlusions 
or maximum range. Given the cardinal role of commu¬ 
nication for the successful operation of a multi-robot 
team, it is then not surprising that a substantial effort 
has been spent over the last years for devising strate¬ 
gies able to preserve graph connectivity despite con¬ 
straints in the inter-robot sensing/communication pos- 


sibilities, see 

, e.g., Antonelli et al ( 

20051; 

Stump et al 

(2008 

20111 

Pei and Mutka 

(20121 

Robuffo Giordano 


et al (20131. In general, fixed topology methods repre¬ 


sent conservative strategies that achieve connectivity 
maintenance by restraining any pairwise link of the in¬ 
teraction graph to be broken during the task execution. 
A different possibility is to aim for periodical connectiv¬ 
ity strategies, where each robot can remain separated 
from the group during some period of time for then re¬ 
joining when necessary. Continuous connectivity meth¬ 
ods instead try to obtain maximum flexibility (links can 
be continuously broken and restored unlike in the fixed 
topology cases) while preserving at any time the funda¬ 
mental ability for any two nodes in the group to share 
information via a (possibly multi-hop) path (unlike in 
periodical connectivity methods). 


With respect to this state-of-the-art, the problem 
tackled by this paper is the design of a multi-target ex¬ 
ploration/visiting strategy for a team of mobile robots 
in a cluttered environment able to i) allow visiting mul¬ 
tiple targets at once (for increasing the efficiency of the 
exploration), while ii) always guaranteeing connectiv¬ 
ity maintenance of the group despite some typical sens¬ 
ing/communication constraints representative of real- 
world situations. Hi) without requiring presence of cen¬ 
tral nodes or processing units (thus, developing a fully 
decentralized architecture), and iv) without requiring 
that all the targets are known at the beginning of the 
task (thus, considering online target generation). 

Designing a decentralized strategy that combines 
multi-target exploration and continuous connectivity 
maintenance is not trivial as these two goals impose 
often antithetical constraints. Several attempts have in¬ 
deed been presented in the previous literature: a fixed- 


topology and centralized method is presented in An- 


tonelli et al (20051, which, using a virtual chain of mo¬ 


bile antennas, is able to maintain the communication 
link between a ground station and a single mobile robot 
visiting a given sequence of target points. The method 


is further refined in Antonelli et al 

(20061. A similar 

problem is addressed in 

Stump et a 

(20081 by resort- 


ing to a partially centralized method where a linear 
programming problem is solved at every step of mo¬ 
tion in order to mix the derivative of the second small¬ 
est eigenvalue of a weighted Laplacian (also known as 
algebraic connectivity, or Fiedler eigenvalue) and the 
k-connectivity of the system. A line-of-sight communi¬ 


cation model is considered in Stump et al (20111, where 


a centralized approach, based on polygonal decomposi¬ 
tion of the known environment, is used to address the 
problem of deploying a group of roving robots while 
achieving periodical connectivity. The case of periodi¬ 
cal connectivity is also considered in Pasqualetti et al 


(20121 and Hollinger and Singh (20121. The first paper 


optimally solves the problem of patrolling a set of points 
to be visited as often as possible. The second presents 
a heuristic algorithm exploiting the concept of implicit 
coordination. Continuous connectivity between a group 
of robots exploring an unknown 2D environment and a 


single base station is considered in Pei et al (20101. The 


proposed exploration methodology, similar to the one 


presented in Franchi et al (20091, is integrated with a 


centralized algorithm running on the base station and 
solving a variant of the Steiner Minimum Tree Prob¬ 
lem. An extension of this approach to heterogeneous 


teams is presented in Pei and Mutka (20121. Zavlanos 


and Pappas (20071 exploit a potential field approach 


to keep the second smallest eigenvalue of the Laplacian 
positive. The method is tested with ground robots in 
an empty environment and assumes that each robot has 
access to the whole formation for computing the con¬ 
nectivity eigenvalue and the associated potential field. 
It is therefore not scalable, because the strength of all 
links has to be broadcasted to all robots in the group. 
Gontinuous connectivity achieved by suitable mission 


planning is described in Mosteo et al (20081, although 


this work does not allow for parallel exploration. An¬ 
other method providing flexible connectivity based on 
a spring-damper system, but not able to handle signif¬ 


icant obstacles, is reported in Tardioli et al (20101. 


A decentralized strategy addressing the problem of 
continuous connectivity maintenance for a multi-robot 


team is considered in Robuffo Giordano et al (20131. 


In this latter work, the introduction of a sensor-based 
weighted Laplacian allows to distributively and analyti¬ 
cally compute the anti-gradient of a generalized Fiedler 
eigenvalue. The connectivity maintenance action is fur- 
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ther embedded with additional constraints and require¬ 
ments such as inter-robot and obstacle collision avoid¬ 
ance, and a stability guarantee of the whole system, 
when perturbed by external control inputs for steering 
the whole formation, is also provided. Finally, apart 
for Robuffo Giordano et al (20131, all the previously 
mentioned continuous connectivity methods have only 
been applied to 2D-environment models. 


In this work, we leverage upon the general decentral¬ 
ized strategy for connectivity maintenance of |RobufFo| 


Giordano et al (20131 for proposing a solution to the 


aforementioned problem of decentralized multi-target 
exploration while coping with the (possibly opposing) 
constraints of continuous connectivity maintenance in 
a cluttered 3D environment. The main contributions of 
this paper and features of the proposed algorithm can 
then be summarized as follows: i) decentralized and 
continuous maintenance of connectivity, ii) guarantee 
of collision avoidance with obstacles and among robots. 
Hi) possibility to take into account non-trivial sens¬ 
ing/communication models, including maximum range 
and line-of-sight visibility in 3D, iv) stability of the 
overall multi-robot dynamical system, v) decentralized 
exploration capability, vi) possibility for more than one 
robot to visit different targets at the same time, vii) on¬ 
line path planning without the need for any (central¬ 
ized) pre-planning phase, viii) applicability to both 2D 
and 3D cluttered environments, and finally ix) com¬ 
pleteness of the multi-target exploration (i.e., all robots 
are guaranteed to reach all their targets in a finite time). 


The items i) - iv) have already been tackled in Robuffo 


Giordano et al (20131 and are here taken as a basis for 


our work. On the other hand, the combination of - 
iv) with the items v) - ix) is a novel contribution: to 
the best of our knowledge, our work is then the first 
attempt to propose a decentralized multi-target explo¬ 
ration algorithm possessing all the mentioned features 
altogether. 


The rest of the paper is organized as follows: Sec. 
provides a formal description of the problem under con¬ 
sideration. The proposed algorithm is then thoroughly 
illustrated in Sec. In Sec. we report the results 
of extensive Monte Garlo simulations and experiments 
with real quadrotors, and Sec. [^concludes the paper. 
In the Appendix, we recap the main features of the de¬ 
centralized continuous connectivity method presented 


Robuffo Giordano et al (20131 which is extensively 


exploited in this paper. We finally note that a prelim¬ 
inary version of our work has been presented in |Nest-| 
meyer et al ( 2013b|a l. 


2 System Model and Problem Setting 


We consider a group of N robots operating in a 3D 
obstacle-populated environment and denote with qi € 
the position of a reference point of the i-th robot, 
i = 1,..., A^, in an inertial world frame. We also let O 
be the set of obstacle points in the environment. Each 
robot i is assumed to be endowed with an omnidirec¬ 
tional sensor able to measure the relative position Qj—qi 
of another robot j provided that: 

1. \\qj — Qill < Rs, where > 0 is the maximum 
sensing range of the sensor, and 

2. min,-g[op],ogo ||(?i-kc(gj -qi)-o|| > i?o, i.e., the line 
segment connecting q^ to qj is at least at distance 
Ro > 0 away from any obstacle point. 


These two conditions account for two common charac¬ 
teristics of exteroceptive sensors, namely, presence of 
a limited sensing range Rs, and the need for a non- 
occluded line-of-sight visibilitjQ We further assume that 
if the *-th robot can measure qj — qi then it can also 
communicate with the j-th robot with negligible de¬ 
lays, that is, the sensing and communication graphs 
are taken coincident. This assumption is justified by 
the fact that communication typically relies on wireless 
technology, thus with a broader range than sensing and 
without the need for direct visibility to operate. The 
neighbors of the f-th robot are denoted with A/i(t), i.e., 
the (time-varying) set of robots whose relative position 
can be measured by the Tth robot at time t. 

Each robot i is also endowed with a sensor that mea¬ 
sures the relative position o — qi of every obstacle point 
o £ O such that ||o — qi\\ < Rm, where Rm > 0 is the 
maximum sensing range of this sensor. 

Gonsider the time-varying (undirected) interaction 
graph defined as G{t) = {V,£{t)), where V = {1,..., N} 
and £{t) = {{i,j) \j S J\fi{t)}. Preserving connectivity 
of Q{t) for all t, allows every robot to communicate at 
any time with any other robot in the network by means 
of a suitable multi-hop routing strategy, although due 
to efficiency and scalability reasons, it is always pre¬ 
ferred to use one-hop communication when possible. 

As previously stated, decentralized continuous con¬ 
nectivity maintenance is guaranteed by exploiting the 
method described in Robuffo Giordano et al (20131, 
which is based on a gradient-descent action that keeps 
positive the second smallest eigenvalue A 2 of the sensor- 
based weighted graph Laplacian ( Fiedler|1973[ | (see Ap¬ 
pendix]^ for a formal definition). 

Each f-th robot is finally endowed with a local mo¬ 
tion controller able to let qi track any arbitrary desired 


^ More complex sensing models could also be taken into ac¬ 
count, see [Robuffo Giordano et a for a discussion in this 

sense. 
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trajectory qi{t) with a sufficiently small tracking er¬ 
ror. This is again a well-justified assumption for almost 
all mobile robotic platforms of interest, and its valid¬ 
ity will also be supported by the experimental results 


of Sec. 4.2 Following the control framework introduced 


Robuffo Giordano et al (20131, the dynamics of qi is 


then modeled as the following second order system 


r; 


- /f - /^ = /, 

qi = 




( 1 ) 


where Vi G is the robot velocity. Mi G is its 

positive definite inertia matrix, and: 


1. = —BiVi G is the damping force (with Bi G 

1^3x3 a positive definite damping matrix) meant 

to represent both typical friction phenomena (e.g., 
wind/atmosphere drag in the case of aerial robots) 
and/or a stabilizing control action; 

2. G is the generalized connectivity force whose 
decentralized computation and properties are thor¬ 
oughly described in Robuffo Giordano et al (20131 
(a short recap is provided in Appendix IAD 

3. fi G is the traveling force used to actually steer 
the robot motion in order to execute the given task. 
An appropriate design of fi is the main goal of 
this work. As will be clear in the following, spe¬ 
cial care must be taken in the design of fi to avoid, 
for instance, deadlocks situations in which the robot 
group ‘gets stuck’. 


The following fact, shown in [Robuffo Giordano et al| 
(20131 and recalled in Appendix |a[ holds! 


Fact 1. As long as fi keeps bounded, the action of the 
generalized connectivity force fl' will always ensure ob¬ 
stacle and inter-robot collision avoidance and continu¬ 
ous connectivity maintenance for the graph Q(t) despite 
the various sensing/communication constraints (in the 
worst case, by completely dominating the bounded fi). 


To summarize, each robot has i) an accurate enough 
measurement of its own location, ii) an omnidirectional 
sensor which is able to measure relative positions of 
other robots and obstacles in its close proximity. Hi) neg¬ 
ligible (compared to the time scale of the robot mo¬ 
tion) communication delays with all robots that it can 
sense/communicate with, iv) the ability to accurately 
track a smooth path with a force controller. 


a continually adjustable list of targets that have 
to be visited by the robot in the presented order. We re¬ 
fer to this algorithm as the target generator of the i-th 
robot, and we also assume that the portion of the map 
needed to reach the next location from the current po¬ 
sition qi is known to robot i. The target generator may 
represent a large variety of algorithms, such as pursuit- 


evasion (Durham et al 20121, patrolling (Pasqualetti 


et al 20121, exploration/mapping (Franchi et al 


2009 


Burgard et al 20051, mobile-ad-hoc-networking (An- 
tonelli et al|2005 1, and active localization (Jensfelt and 


Kristensen||2001 1. It might be a cooperative algorithm. 


or each robot could have a target generator with objec¬ 
tives that are independent from the other target gener¬ 
ators. Another possibilty is to appoint a human super¬ 
visor as the target generator. 

Depending on the particular application, the loca¬ 
tions in the lists provided online by the target genera¬ 
tors may, e.g., represent: 


1. view-points from where to perform the sensorial ac¬ 
quisitions, 

2. coordinates of objects that have to be picked up or 
dropped down, 

3. positions of some base stations located in the envi¬ 
ronment. 

We formally denote with (z/,..., zf') g the 

list of mi locations provided by the i-th target gener¬ 
ator. Additionally, we consider the possibility, for the 
target generator, to specify a time duration At^ < oo 
for which the i-th robot is required to stay close to the 
point Zi, with k = 1,... ,mi. This quantity may repre¬ 
sent, with reference to the previous examples, the time 


1. needed to perform a full sensorial acquisition, 

2. necessary to pick up/drop down an object, 

3. required to upload/download some information from 
a base station. 


and can also possibly be adjusted at runtime during the 
execution of the respective task. 

Finally, we also introduce the concept of a cruise 
speed > 0 that should be maintained by the i-th 

robot during the transfer phase from a point to the next 
one. 

Given these modeling assumptions, the problem ad¬ 
dressed in this paper can be formulated as follows: 


2.1 Multi-target Exploration Problem 

We consider the broad class of problems in which each 
robot runs a black-boxed algorithm that produces on- 


^ By online we mean that the targets are generated at runtime, 
thus precluding the presence of a preliminary phase in which the 
robots may plan in advance the multi-target exploration action. 
Indeed, if all the targets are known beforehand, one could still 
apply our method but other planning strategies might potentially 
lead to better solutions. 
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Problem 1. Given a sequence of targets 
(presented online) for every robot i = to¬ 

gether with the corresponding sequence of time dura¬ 
tions At\,..., and a radius R^, design, for ev¬ 
ery i = a decentralized feedback control law 

fi (i.e., a function using only information locally and 
1-hop available to the i-th robot) for system Q which 
is bounded and such that, for the closed-loop trajectory 
fi,[o,t))> there exists a time sequence 0 <tl < ... < 
t^' < oo so that for all k = 1.. .mi, robot i remains 
for the duration Atf within a ball of radius Rz centered 
at zf, formally Vt S [t’(,t’( + At^] : \\q,{t) - zf\\ < Rz- 


3 Decentralized Algorithm 


In this section, we describe the proposed distributed 
algorithm aimed at generating a traveling force fi that 
solves Problem We note that the design of such an 
autonomous distributed algorithm requires special care: 
When added to the generalized connectivity force in 0. 
the traveling force fi should fully exploit the group ca¬ 
pabilities to concurrently visit the targets of all robots 
whenever possible and, at the same time, should not 
lead to Hocal minima) where the robots get stuck, due 
to the simultaneous presence of the hard connectivity 


constraint. While Robuffo Giordano et al (20131 already 
gave an exact description of ff^ and ff, an application 
of fi was kept open. The main focus of this work is 
to define fi in such a way that the above mentioned 
challenges are properly addressed. 

In order to provide an overview of the several vari¬ 
ables used in Secs. andwe included Tablefor the 
reader’s convenience. 


3.1 Notation and Algorithm Overview 

As in any distributed design, several instances of the 
proposed algorithms run separately on each robot and 
locally exchange information with the ‘neighboring’ in¬ 
stances via communication. Each instance is split into 
two concurrent routines: a planning algorithm and a 
motion control algorithm whose pseudocodes are given 
in Algorithmic and Algorithmic respectively. The plan¬ 
ning algorithm acts at a higher level and performs the 
following actions: 

— it processes the targets provided by the target gen¬ 
erator, 

— it generates the desired path to the current target, 
and 

— it selects an appropriate motion control behavior 
(see later). 


variable 

N 

Vi 

O 

Rs 

Ro 

Rc 

Mi 

Q 

^2 


J i 

J i 

h 

At>r 

Rz 

j.cruise 

7 i 


dl 

R-y 

aA 


{Xc,Xm) 

Ai 


Oi 


a 


Pi 


Table 1: Meaning of the variable names, 
meaning 

number of robots 
position of 2 -th robot 
velocity of 2 -th robot 
set of obstacle points 
maximum sensing range 
minimum distance to obstacle 
minimum inter-robot distance 
neighbors of 2 -th robot 
interaction graph 

second smallest eigenvalue of the sensor-based 

weighted graph Laplacian 

generalized connectivity force 

damping force 

traveling force 

/c-th target of 2 -th robot 

amount of time to stay close to target 

maximum distance to target when anchored 

maximum cruise speed 

path to current target, starting from position of 
robot at time of computation 
closest point of path from current position 
length of remaining path 

distance to path at which it should be re-planned 
weighting of position vs. velocity error 
absolute tracking error of 2 -th robot along path 
tracking error bounds for the traveling efficiency 
traveling efficiency of 2 -th robot (i.e., tracking er¬ 
ror nonlinearly scaled to [0,1] based on aic, xm) 
estimation of the traveling efficiency of the ‘prime 
traveler’ by the 2 -th robot 

force direction alignment between connectivity 
and traveling force of 2 -th robot 
weighting between the force direction alignment 
and the ‘prime traveler’ traveling efficiency 
downscaling factor of a ‘secondary traveler’, de¬ 
pendent on Ap, &i and a 


The motion control algorithm acts at a lower level by 
specifying the traveling force fy as a function of the 
behavior and the planned path selected by the planning 
algorithu 0 

The two algorithms have access to the same vari¬ 
ables which are formally introduced as follows (see Fig.[^ 
for a graphical representation of some of these vari¬ 
ables): the variable targetQueuei is filled online by the 
target generator and contains a list of future targets to 
be visited by the Tth robot. During the overall running 
time of the algorithms, the target generator of robot i 
has access to the whole list targetQueuei (which can also 
be changed online if needed). The current target for the 
i-th robot (i.e., the last target extracted from the first 
entry in targetQueue^) is denoted with Zi. Variable 7 ^ is 
a geometric path that leads from the current position 
Qi of the i-th robot to the target Zi. In our implementa- 

^ The two routines can run at two different frequencies, typi¬ 
cally slower for the planning loop and faster for the motion control 
loop. 
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Fig. 1: Position qi and path 7 ^ followed by a traveler from the 
point to the current target Zi. The solid part of the path 
represents the remaining path which starts at the closest point 
on the path q'^ and whose length is denoted by . 


tion, we used B-splines ( Biagiotti and Melchiorri||2008 1 
in order to get a parameterized smooth path, but any 
other path would be appropriate. If the robot is not 
traveling towards any target, then 7 ^ is set to null. 

With reference to Fig. we also denote with qj the 
closest point of the path 7 ^ to qi, i.e., the solution of 
argminpg^. \\p — qi\\. In case of multiple solutions, we 
choose the one with the largest arc-length, i.e., the one 
nearest to the target along the path. Therefore, the 
closest point q] can be considered as unique in the fol¬ 
lowing. The portion of the path 7 ^ from qj to Zi is 
referred to as the remaining path, and its length is de¬ 
noted with dj. 

The motion behavior of the i-th robot is determined 
by the variable states that can take four possible values: 


— connector 

— prime-traveler 

— secondary-traveler 

— cinchor. 


The following provides a qualitative illustration of these 
motion behaviors, while a functional description is given 
in the next sections: 


• ‘connector’: a robot in this state is not assigned any 
target by the target generator and therefore, its only 
goal is to help keeping the graph Q connected. For 
this reason fi is set to zero and hence the robot is 
subject solely to the damping and generalized con¬ 
nectivity force //* in ([^; 

• ‘prime traveler’: a robot in this state travels towards 
its current target Zi along the path 7 ^ thanks to the 
force fi- At the same time, the robot distributively 
broadcasts to every other robot a non-negative real 
number, denoted with Ai, that represents its travel¬ 
ing efficiency, i.e., a measure how well it is able to 
follow its desired path while being influenced by the 
other robots in the group via the generalized con¬ 
nectivity force fl' (which is described in more detail 
later). It is essential for the algorithm that only one 
‘prime traveler’ exists in the group at any time. Ev¬ 
ery other robot with an assigned target needs to be a 


‘secondary traveler’ or ‘anchor’. This feature will al¬ 
low one robot (the ‘prime traveler’) to reach its target 
with a high priority, while the other robots will only 
be allowed to reach their own targets as long as this 
action does not hinder the ‘prime traveler’ goal. 

• ‘secondary traveler’: a robot in this state travels to¬ 
wards its current target zi along the path 7 ^ thanks to 
the force fi. The robot keeps an internal estimation 
Ap of the traveling efficiency of the current ‘prime 
traveler’, and it scales down the intensity of its trav¬ 
eling force fi by an adaptive gain pi whenever the ac¬ 
tion of fi is ‘too conflicting’ w.r.t. that of f^, or the 
‘prime traveler’ A^ drops lower than a given thresh¬ 
old. 

• ‘anchor’: a robot in this state has reached the prox¬ 
imity of the target Zi. The force fi is then exploited 
in order to keep qi within a circle of radius Rz cen¬ 
tered at Zi (i.e., the robot is ‘anchored’ to the target), 
while waiting for the associated time Ati to elapse. 

In order to obtain a better intuition of the roles of 
the robots, we suggest the reader to watch the “Empty 
Space” video available in the attached multimedia ma- 
teria 0 

To summarize this qualitative description, these be¬ 
haviors are designed in such a way that the single ‘prime 
traveler’ approaches its target with the highest priority, 
the ‘secondary travelers’ approach their targets as long 
as they have enough spatial freedom by the generalized 
connectivity force, the ‘anchors’ stay close to the target 
until their task is completed, and the ‘connectors’ help 
the ‘secondary travelers’ in providing as much spatial 
freedom as possible while preserving the connectivity of 
the graph. 

Whenever a robot moves, it may indirectly exert a 
certain generalized connectivity force on all its neigh¬ 
bors because of the properties of //' (i.e., for retaining 
generalized connectivity of the graph Q (see 
and Appendix [A|) . This 
tivity action can possibly conflict with the traveling 
force fi, and also prevent, in the worst case, fulfilment of 
the multi-target exploration task (e.g., the group falls 
in a local minimum because two robots start travel¬ 
ing in opposite directions over too large distances, thus 
threatening connectivity maintenance). 

Since the ‘connectors’ implement /^ = 0 by defini¬ 
tion, they cannot directly hinder the ‘prime traveler’ 
motion. In other words, a group made by all ‘connec¬ 
tors’ and one ‘prime traveler’ would always allow the 
‘prime traveler’ to reach its target. Presence of ‘an¬ 
chors’ can instead block the ‘prime traveler’ because of 

^ http://homepages.laas.fr/afranchi/videos/multi_exp_ 
conn.html 


Giordano et al (2013 


Robuffo 

connec- 
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Procedure Start-up for Robot i 


1 

2 

3 

4 

5 

6 
r 

s 

g 

10 

11 


if targetQueuSi is empty then 
^ null 

statCj ^ connector 
else 

Extract first target from targetQueuej and save it as Zi 
^ Shortest obstacle-free path from qi to zi 
Enroll in the list of Candidates to take part in the first 
distributed ‘prime traveler’ election 

if i = 3.\:gniiaj^Candidates<^ 

states <— prime-traveler 
else 

7tatei <— secondary-traveler 


12 Ap <— 0 

13 Run Algorithm 1 and Algorithm 2 in parallel 


the anchoring force which prevents them to move away 
from their targets. Nevertheless the anchoring phase 
can only last for a finite time after which the ‘an¬ 
chor’ changes state and is again free to move. 

No such mechanism is instead present for the ‘sec¬ 
ondary travelers’ which would constantly attempt to 
move along their paths with a pi set to 1. As explained, 
if many robots are simultaneously traveling in arbitrary 
directions inside a cluttered environment, while also 
maintaining connectivity of Q, the overall group motion 
can potentially (and quite easily) fall into a local mini¬ 
mum. The idea behind the gain pi is to then adaptively 
scale down the traveling force fi of the ‘secondary trav¬ 
elers’ whenever either (i) the direction fi deviates too 
much from the connectivity force /7 or (ii) the ‘prime 
traveler’ motion is nevertheless too obstructed by the 
actions of the other ‘secondary travelers’ in the group. 
Consequently, this gain pi S [0,1] is chosen so that the 
current ‘prime traveler’ can always reach its target, no 
matter the motion planned by the ‘secondary travelers’ 
in the group. A formal description of this concept will 


be given in Sec. 3.7 


3.2 Start-up phase 

The Procedure jStart-up for Robot i\ performs the dis¬ 
tributed initialization of the planning and motion con¬ 
trol algorithms. Its pseudocode is quickly commented 
in the following. 

At the beginning, if targetQueuei is empty then the 
path 7 i is set to null and states to connector (line[^. 
Otherwise the first target from targetOueue^ is 
extracted and saved in Zi. Then, the robot i computes 
a Cf shortest and obstacle-free path 7 ^ that connects 
its current position qi with Zi (line [^. This path is 
generated with a two-step optimization method: first. 


the known portion of the map is discretized into an 
equally spaced grid in 3D with a cell size of Rgnd- A 
cell is marked as occupied whenever an obstacle lies 
inside a radius of Rgrid around the cell. On this grid, 
a shortest path is found via A*. Then, the waypoints 


obtained from A* are approximated with a B-spline (Bi- 


agiotti and Melchiorri|2008 1 in order to remove corners 


from the path. We note that, depending on the smooth¬ 
ing parameter, this approximation is not guaranteed to 
leave enough clearance from surrounding obstacles. Ob¬ 
stacle avoidance is nevertheless ensured thanks to the 
presence of the generalized connectivity force that pre¬ 
vents any possible collisions by (possibly) locally ad¬ 
justing the planned path when needed. As an alterna¬ 
tive, one could also rely on the method proposed in|Ma-| 


sone et al (20121 for directly generating a smooth path 


with enough clearance from obstacles. 

Subsequently, the robot takes part in the distributed 
election of the first ‘prime traveler’ (see Sec. 3.31. De¬ 
pending on the outcome of this election, states is set 
either to prime-traveler or secondcury-traveler. 

At the end of the initialization procedure, the esti¬ 
mate Ap of the traveling efficiency of the current ‘prime 
traveler’ is initialized to zero (line[I^ for all robots, and 
the planning and motion control algorithms are both 
started (line 131. 


3.3 Election of the ‘prime traveler’ 


In a general election of a new ‘prime traveler’, the cur¬ 
rent ‘prime traveler’ triggers the election process (line[l4| 
of Algorithm [^, to which every ‘secondary traveler’ 
replies with its index and remaining path length, in 
order to be taken into the list of candidates (line . 
Since this election is a low-frequency event, we chose to 
implement it via a simple flooding algorithm (|Lim and] 


Kim 2001 1 . Although this solution complies with the 


requirement of being decentralized, one could also re¬ 


sort to‘smarter’ distributed techniques such as (Lynch 


19971. The ‘prime traveler’ then waits for 2(N —1) steps 


to collect these replies, being 2{N — 1) the maximum 
number of steps needed to reach every robot with flood¬ 
ing and obtain a reply. The winner of this election is 
then the robot with the shortest remaining path length 


d], i.e., the robot solving argminjgcandidates fij• In the 
unlikely event of two (or more) robots having exactly 
the same remaining path length, the one with the lower 
index is elected. During the whole election process, the 
‘prime traveler’ keeps its role and only upon decision 
it abdicates by switching into the ‘anchor’ state. After 
announcing the winner, no ‘prime traveler’ exists in the 
short time interval (at most N — 1 steps) until the an¬ 
nouncement reaches the winning ‘secondary traveler’. 
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Algorithm 1: Planning for Robot i 

1 while 

true do 

2 

switch statei do 

3 


case connector do 

4 



if targetQueuei is not empty then 

5 




Extract the next target from targetQueuei and 





save it as Zi 

6 




7 i 4r- Shortest obstacle-free path from qi to Zi 

r 




if There is no ‘prime traveler’ in the group then 

8 




statei ■<— prime-traveler 

g 




else 

10 




^statei -^secondary-traveler 

11 



case prime-traveler do 

12 



if \\qi — Zi\\ < Rz then 

13 




7 i -(r- null 

14 




Permit ‘prime traveler’ candidacy within timeout 

15 




statei cuichor 

16 


case secondary-traveler do 

ir 



if \\qi — q2\\ > R'y then 

18 




7 i ■<— Shortest obstacle-free path from qi to Zi 

19 



if ||(ji — Zi\\ < Rz then 

20 




7 i 4r- null 

21 




statei cuichor 

22 



else if ‘prime traveler’ candidacy is allowed then 

23 




Enroll in the list of Candidates to take part in 





the distributed ‘prime traveler’ election 

24 




if i = arg min^gcandidates 7 then 

25 




^statei prime-traveler 

26 



case anchor do 

27 



if task at target Zi is completed then 

28 




statei connector 







This winning robot then switches into the ‘prime trav¬ 
eler’ behavior. This mechanism makes sure that at most 
one ‘prime traveler’ exists at any given time. 


The first election in the Start-up phase (see Sec. 3.2 


and line in Procedure ' Start-up for Robot i ) is han¬ 
dled slightly differently. Instead of the current ‘prime 
traveler’ organizing the election, robot 1 is always as¬ 
signed the role of host and, instead of the only ‘sec¬ 
ondary travelers’ replying, every robot with an assigned 
target replies with its index and remaining path length 
(including robot 1 if it has an assigned target). 


3.4 Planning Algorithm 

In this section, we describe in detail the execution of 
Algorithm running on the i-th robot, whose logical 
flow is provided in Figure as a graphical representa¬ 
tion. The algorithm consists of a continuous loop where 
different decisions are taken according to the value of 



states and according to the following different behav¬ 
iors: 

3.4-1 case connector 

If states is set to connector then targetQueue^ is checked. 
In case of an empty queue, state.^ remains connector, 
otherwise the next target is extracted from the queue 
and saved in Zi (line [^. Then the i-th. robot com¬ 
putes a shortest and obstacle-free path 7 ^ connect¬ 
ing the current robot position qi with Zi (line im¬ 
plementing what was previously described in the start¬ 
up procedure. Finally, the robot changes the value of 
state^ in order to track 7 ^. In particular, if no ‘prime 
traveler’ is present in the group, then states is set to 
prime-traveler (line |^. Otherwise, states is set to 
secondcury-traveler (line |lO|Pj 


3.4.2 case prime-traveler 


When states is set to prime-traveler (line 11) and 
the current position qi is closer than Rz to the target 


Zi (line 12 ), the following actions are performed: 


— the path 7 ^ is reset to null (line [T^ , 

— a new distributed ‘prime traveler’ election as de- 
announced (line[l4|. 


scribed in Sec. 3.3 


IS 


— the robot abdicates the role of ‘prime traveler’ and 
states is set to cuichor (line [T^ . 


If, otherwise, Zi is still far from the current robot po¬ 
sition qi, then state^ remains unchanged and the robot 
continues to travel towards its target. 


® Presence of a ‘prime traveler’ can be easily assessed in a dis¬ 
tributed way by, e.g., flooding l |Lim and Kim||2001| l on a low 
frequency. 
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3.4-3 case secondary-traveler 


When states is secondary-traveler (line 16 1 the dis¬ 
tance ||(?7 ~ 9*11 (closest point on the) path is 


checked (line 171. If this distance is larger than the 
threshold R-y, the robot replans a path from its current 
position Qi (line [I^ . This re-planning phase is neces¬ 
sary since a ‘secondary traveler’ could be arbitrarily far 
from the previously planned "ft because of the ‘dragging 
action’ of the current ‘prime traveler’. Section |3.6| will 
elaborate more on this point. Subsequently, if qi is closer 
than Rz to the target Zi (line[T^, the path 7 ^ is reset to 
null (line [20| and states is set to anchor (line 21 1 . Oth¬ 


erwise, if the target is still far away, the robot checks 
whether the ‘prime traveler’ abdicated and announced 
an election of a new ‘prime traveler ’ (line[^. If this was 
the case, the robot takes part in the election (line 231 


as described in Sec. 13.31 If the robot wins the election 


(line 241, state^ is set to prime-traveler (line 25) oth¬ 
erwise it remains set to secondary-traveler. 


3 . 4.4 case anchor 

The last case of Algorithm [l] is when states is anchor 
(line [ 2 ^ . The robot remains in this state until the task 
at target Zi is completed (line [27|) , after which state^ is 
set to connector. 


3.5 Completeness of the Planning Algorithm 

Before illustrating the motion control algorithm, we state 
some important properties that hold during the whole 
execution of the planning algorithm. 

Proposition 1 If there exists at least one target in one 
of the targetQueuei, then exactly one ‘prime traveler’ 
will be elected at the beginning of the operation. Fur¬ 
thermore, this ‘prime traveler’ will keep its state until 
being closer than Rz to its assigned target. In the mean¬ 
time no other robot can become ‘prime traveler’. 

Proof. The start-up procedure guarantees that, if there 
exists at least one target in at least one of the tar¬ 
getQueuei, the group of robots includes exactly one 
‘prime traveler’ and no ‘anchor’ at the beginning of the 
task. Any other robot is either a ‘connector’ or ‘sec¬ 
ondary traveler’ depending on the corresponding avail¬ 
ability of targets. During the execution of Algorithm [l] 
a robot can only switch into ‘prime traveler’ when be¬ 
ing a ‘connector’ or a ‘secondary traveler’. As long as 
there exists a ‘prime traveler’ in the group, a ‘connec¬ 
tor’ cannot become a ‘prime traveler’. Furthermore, a 
‘secondary traveler’ becomes a ‘prime traveler’ only if 


it wins the election announced by the ‘prime traveler’. 
Since the ‘prime traveler’ allows for this election only 
when in the vicinity of its target (within the radius Rz), 
the claim directly follows. □ 

Using this result, the following proposition shows 
that Algorithmj^is actually guaranteed to complete the 
multi-target exploration in the following sense: when 
presented with a finite amount of targets, all targets of 
all robots are guaranteed to be visited in a finite amount 
of time. In order to show this result, an assumption on 
the robot motion controller is needed. 

Assumption 1. In a group of robots with exactly one 
‘prime traveler’, the adopted motion controller is such 
that the ‘prime traveler’ is able to arrive closer than Rz 
to its target in a finite amount of time regardless of the 
location of the targets assigned to the other robots. 

In Sec. 13.71 we discuss in detail how the motion con¬ 
troller introduced in the next section [T 6 | meets Assump¬ 
tion [ 1 ] 

Proposition 2 Given a finite number of targets and 
a motion controller fulfilling Assumption [7| the whole 
multi-target exploration task is completed in a finite 
amount of time as long as the local tasks at every target 
can be completed in finite time. 

Proof. In the trivial case of no targets, the multi-target 
exploration task is immediately completed. Let us then 
assume presence of at least one target. Proposition 
guarantees existence of exactly one ‘prime traveler’ at 
the beginning of the planning algorithm, and that such 
a ‘prime traveler’ will keep its role until reaching its tar¬ 
get, an event that, by virtue of Assumption happens 
in finite time. At this point, assuming as a worst case 
that no ‘secondary traveler’ has reached and cleared its 
own target in the meantime, one of the following situ¬ 
ations may arise: 

1. There is at least one ‘secondary traveler’. The ‘sec¬ 
ondary traveler’ closest to its target becomes the 
new ‘prime traveler’ in the triggered election, and 
it then starts traveling towards its newly assigned 
target until reaching it in finite time (Proposition 
and Assumption 

2. There is no ‘secondary traveler’ and no other ‘an¬ 
chor’ besides the former ‘prime traveler’. In this 
case, no other robot has targets in its queue as, oth¬ 
erwise, at least one ‘secondary traveler’ would exist. 
Therefore, after completing its task at the target 
location (a finite duration), the former ‘prime trav¬ 
eler’ and now ‘anchor’ becomes ‘connector’ and, in 
case of additional targets present for this robot, it 
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Algorithm 2: Motion Control for Robot i 

1 while true do 

2 

switch statei do 

3 


case connector do 

4 



Update yij, using dj, = “ ^p) 

5 



_f^^0 

6 


case “prime-traveler do 

r 



Ap <— Ai using l|^ 

8 



_/i <- /travel(gi,7i,v““=®), Using ||^ 

g 


case secondary-traveler do 

10 



Update A^ using A^ = i^P ~ ^p) 

11 



/it- Pi/travel(ft,7i,'!ii‘''“®®), Using ^ and ([^ 

12 


case anchor do 

13 



Update Alp using 7^ = k^ YjsAfi (^P ~ ^7) 

14 


- 

_/i t- /anchor(gi,^i,-Rz), as per 1 ^ 


switches back into being a ‘prime traveler’ and trav¬ 
els towards the new targets in a finite amount of 
time as in case 1 . 

3. There is no ‘secondary traveler’, but at least one 
other ‘anchor’. This situation can be split again into 
two sub-cases: 

(a) there exists at least one ‘anchor’ with a future 
target in its queue. Then, after a finite time, this 
‘anchor’ becomes ‘secondary traveler’ and case 1 
holds; 

(b) there is no ‘anchor’ with a future target in its 
queue. Then, after a finite time, all ‘anchors’ 
have completed their local tasks and case 2 holds. 

In all cases, therefore, one target is visited in finite 
time by the current ‘prime traveler’. Repeating this loop 
finitely many times, for all the (finite number of) tar¬ 
gets, allows to conclude that all targets will be visited 
in a finite amount of time, thus showing the complete¬ 
ness of the planning algorithm. 

If a ‘secondary traveler’ already reaches its target 
while the ‘prime traveler’ is active, the aforementioned 
worst case assumption is not valid anymore. But since, 
in this case, the target of the ‘secondary traveler’ is 
already cleared, the total number of iterations is even 
smaller than in the previous worst case, thus still re¬ 
sulting in a finite completion time. □ 


3.6 Motion Control Algorithm 


before, in which the force fi is computed according to 
the behavior encoded in the variable states determined 
by Algorithm [2 

3.6.1 case connector 

If state^ is set to connector, the estimate of the 
traveling efficiency of the current ‘prime traveler’ is up¬ 
dated with a consensus-like algorithm (line that will 
be described in the next Sec. |3.7| The traveling force 
fi is in this case simply set to 0 (line |^ . It is worth 
mentioning that fi = 0 does not mean the f-th robot 
will not move, since a ‘connector’ is still dragged by the 
other travelers via the generalized connectivity force 
(according to Q, the ‘connectors’ are still subject to 
and /f). 


3.6.2 case prime-traveler 


If states is set to prime-traveler, the estimate is 
set to the true traveling efficiency Ai, defined by ([^ 
(line 1 ^ . Afterwards (line[^ the robot sets 

fi = /travel (?*, 7i, , (2) 


where /travel (?*, 7i, e is a proportional, deriva¬ 

tive and feedforward controller meant to travel along 7 ^ 
at a given cruise speed 


/travel(ft I 7ii 


^) = a7(u~,g7) 

+ fc„(u7(u~,97)-ft) 

+ - li)- 


( 3 ) 


Here, kp and ky are positive gains, qf is the point on 7 ^ 
closest to qi (see Fig. [^, u7(u“'“'®®, g/) is the velocity 
vector of a virtual point traveling along 7 ^ and passing 
at qf with tangential speed u®™'®®, and aj, q]) is 
the acceleration vector of the same point. It is straight¬ 
forward to analytically compute both the velocity and 
the acceleration from u®’'®”®®, given the spline represen¬ 
tation of the curve (Biagiotti and Melchiorri||2008 1 . 


3.6.3 case secondary-traveler 

If states is set to secondairy-traveler, the esti mat e 
At is updated with a consensus-like protocol (line 


Then (line 11 \ the robot sets 
fi = Pr/travel(ft,7r,l^i™®), 


lOl. 


( 4 ) 


With reference to Algorithm we now describe the 
motion control algorithm that runs in parallel to the 
planning algorithm on the i-th robot, and whose goal is 
to determine a traveling force fi that can meet Assump¬ 
tion [2 The algorithm consists of a continuous loop, as 


where /travel is defined as in (i and Pi G [ 0 , 1 ] is an 
adaptive gain meant to scale down the intensity of the 
action of ftravel(qi,7i, u®''®“®®) whenever fij its alignment 
is too conflicting with the generalized connectivity force 
fi or (a) the ‘prime traveler’ is not able to efficiently 
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Fig. 3: Shape of the function defined in § that is 0 on 

the target Zi itself [£.i = 0) and grows unbounded at the border 
of a sphere with radius Rz. 


travel along its path because its reached speed is too 
low compared to its desired cruise speed. Section [3^ is 
dedicated to provide details on choosing an effective pi. 


3 . 6.4 case anchor 


If states is set to cinchor, the estimate is again up¬ 
dated using a consensus-like protocol (line 131. Then 
(line 141 the force fi is set as 


Fig. 4: Sketch of the function A(x,Xc,xm} for fixed Xc and xm- 


and the generalized connectivity force is too differ¬ 
ent, or (ii) the traveling efficiency of the ‘prime traveler’ 
is too low. 

Therefore the design of pi aims at guaranteeing that 
the current ‘prime traveler’ can always reach its target, 
whatever the motion planned by the other robots in the 
group are, and thus in fact enforces Assumption [l] 

We recall that we provide a compendium of all im¬ 
portant variables in Table 

In order to implement the desired behavior we in¬ 
troduce two functions: 


fi — fanchoriQij — 


dqi 


0 : X ^ [0,1] 

A:R+ xK* ^ [0,1] 


where : [0,Rz) -A [0,oo) is a monotonically in¬ 

creasing potential function of the distance £i = ||gi — Zi|| 
between the robot position qi and the target Zi, and 
such that = 0 and lim^./.fl, 

Under the action of /anchor(ft; U, .^z) the position qi 
is then guaranteed to remain confined within a sphere 
of radius centered at Zi until the local task at the 
target location is completed. In our simulations and ex¬ 
periments we employed 

Knchor(^*) = -kz^-^ In (cos (|^)) (6) 


where kz is an arbitrary positive constant. The shape 
of this function is shown in Fig. and the associated 


/anchor 1® 


/anchor(ft, ft, Rz^ — kz tan 


iiir 



( 7 ) 


3.7 Traveling Efficiency, Force Alignment and 
Adaptive Gain 

We now describe how the estimation of the traveling 
efficiency Ai of all robots and the adaptive gain pi of a 
‘secondary traveler’, used in Algorithm]^ are actually 
computed. Remember that the idea behind the gain pi 
is to adaptively scale down the traveling force fi of a 
‘secondary traveler’ whenever (i) the alignment of fi 


as: 


0(x,y) = 


G K2|o 

< Xc < Xm}, 

defined 

x'^y \ 

M\\\y\\) 

X 0,2/ 7^ 0 

otherwise 

(8) 


X G [0,Xc] 

,) (9) 

2 

^ X G (Xc,Xa 


X e [xm, oo). 


Function 0{x,y) represents a ‘measure’ of the direction 
alignment of the two non-zero 3D vectors x and y. In 
particular, &(x,y) is 1 if x and y are parallel with the 
same direction, ^ if they are orthogonal, and 0 if they 
are parallel with opposite direction. Note that 0(x, y) is 
equivalent to ^ (l-kcos 9) with 9 being the angle between 
vectors x and y. 

Function A{x,Xc,xm) ‘measures’ how small x is. If 
X < Xc then x is considered ‘small enough’ and, there¬ 
fore, A = 1. If X € (xc,xm) then A strictly monoton¬ 
ically varies from 1 to 0. If x > xm, then A = 0. The 
shape of A is depicted in Fig. 

Having introduced these functions, we now define 
the force direction alignment of the i-th. robot as 

0. = 0{ft /travel(ft,7*,F~)), (10) 


and note that &i can be locally computed by the i-th 
robot. The quantity Oi thus represents an index in [0,1] 
measuring the degree of conflict among the directions 
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of the generalized connectivity force and the traveling 
force. 

When ^ null, we also define the absolute track¬ 
ing error as 

Ci = (1 -aA)||T7(ur“"®,q7) + aAhJ “ Qi\\, (H) 

with a A G [0,1] being a constant parameter modulating 
the importance of the velocity tracking error w.r.t. the 
position tracking error. The traveling efficiency is then 
defined as 

Ai = A{ei,Xc,XM), ( 12 ) 

where 0 < Xc < xm < oo are two user-defined thresh¬ 
olds representing the point at which the traveling ef¬ 
ficiency Ai starts to decrease and the maximum toler¬ 
ated error after which the traveling efficiency vanishes. 
In this way it is possible to evaluate how well a trav¬ 
eler can follow its desired planned path according to a 
suitable combination of velocity and position accuracy. 
It is important to note that the value Ai = \ does not 
imply an exact tracking of the path, but it still allows 
a small tracking tolerance (dependent on the param¬ 
eter Xc)- Similarly, the value = 0 does not imply 
a complete loss of path tracking, but it represents the 
possibility of a tracking error higher than a maximum 
threshold (dependent on xm)- 

In order to meet Assumption we are only inter¬ 
ested in the traveling efficiency of the current ‘prime 
traveler’ for monitoring whether (and how much) its 
exploration task is held back by the presence/motion 
of the ‘secondary travelers’. From now on we then de¬ 
note this value as Ap, where 






Fig. 5: Function p for ct = 2 (left side) and cr = 6 (right side). 
The motion controller exploits this function by plugging the force 
direction alignment in the x argument, and the estimate of the 
traveling efficiency of the current ‘prime traveler’ in the y argu¬ 
ment. 


p = i s.t. state^ = prime-traveler. 

This quantity is not in general locally available to 
every robot in the group, and therefore a simple decen¬ 
tralized algorithm is used for its propagation to avoid a 
flooding step. Among many possible choices we opted 
for using the following well-known consensus-based prop¬ 
agation ( |01fati-Saber and Murray]|2003 1: 

(13) 

Al = Ai if i = p. 


a PI average consensus estimator ( Freeman et al|[2006 1 
to cope with presence of a time-varying signal. How¬ 
ever, for simplicity we relied on a simple consensus law 
with less parameters to be tuned, and with, neverthe¬ 
less, a satisfying performance as extensively shown in 
our simulation and experimental results. 

Hence, every ‘secondary traveler’ can locally com¬ 
pute 0i and build an estimation Ap of Ap. In order to 
consolidate these two quantities into a single value, we 
define the function p : [0,1] x [0,1] x [1, oo) —)■ [0,1] as: 

Pix, y, a) = {l-x)y'^ +x{l-{l- y)'^), (14) 


This distributed estimator lets Ap track Ap for all i 
that hold state^ yf prime-traveler with an accuracy 
depending on the chosen gain Pa- Notice that, for a 
constant Ap, the convergence of this estimation scheme 
is exact. Furthermore, since Ap G [0,1], A^ is then sat¬ 
urated so as to remain in the allowed interval despite 
the possible transient oscillations of the estimator. In¬ 
stead of this simple consensus, one could also resort to 


where 1 < cr < cxd is a constant parameter. Gain pi is 
then obtained from and Ap as 

p^ = p(0^,Ap,<T) (15) 

with 1 < cr < (X) being a tunable parameter. 

The reasons motivating this design of gain pi are as 
follows: Pi is a smooth function of 0i and Ap possessing 
the following desired properties (see also Fig. 
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1. yip = 1 Pi = 1: if the traveling efficiency of the 
‘prime traveler’ is 1 then every ‘secondary traveler’ 
sets /, = /travel(gi,7o'y*™®); 

2. yip = 0 Pi = 0: if the traveling efficiency of the 
‘prime traveler’ is 0 then every ‘secondary traveler’ 
sets /i = 0; 

3. Pi monotonically increases w.r.t. yip for any 0i and 
a in their domains; 

4. Pi constantly increases w.r.t. 0i for any yip G (0,1) 
and cr > 1; 

5. if cr = 1 then pi = yip for any 0i G [0,1] 

6. if cr —)• 00 then pi —>• 0i for any yip G (0,1). 

Summarizing, gain pi mixes the information of both 
the force direction alignment and the traveling efficiency 
of the ‘prime traveler’, with more emphasis on the first 
or the second term depending on the value of the pa¬ 
rameter cr. Nevertheless, the traveling efficiency yip is 
always predominant at its boundary values (0 and 1) 
regardless of the value of cr. This means that, when¬ 
ever the estimated travel efficiency of the ‘prime trav¬ 
eler’ is yip = 0 and robot i is a ‘secondary traveler’, its 
traveling force is scaled to zero and, therefore, robot i 
only becomes subject to the connectivity and damping 
force. Therefore, in this situation the motion of all ‘sec¬ 
ondary travelers’ results dominated by the ‘prime trav¬ 
eler’, which is then able to execute its planned path 
towards its target location. On the other hand, when 
/Ip = 1, the ‘prime traveler’ has a sufficiently high trav¬ 
eling efficiency despite the ‘secondary traveler’ motions. 
Therefore, every ‘secondary traveler’ is free to travel 
along its own planned path regardless of the direction 
alignment between traveling and connectivity force. 

We conclude noting that the main goal of the ma¬ 
chinery defined in Sec. |3.6| -Sec. |3.7| is to ensure that 
the motion controller meets the requirements defined 
in Assumption Although some of the steps involved 
in the design of the traveling force fi have a ‘heuristic’ 
nature, the proposed algorithm is quite effective in solv¬ 
ing the multi-target exploration task (in a decentralized 
way) under the constraint of connectivity maintenance, 
as proven by the several simulation and experimental 
results reported in the next section. 


All the simulation (and experimental) results were 
run in 3D environments, although only a 2D perspec¬ 
tive is reported in the videos for the simulated cases 
(therefore, robots that may look as ‘colliding’ are ac¬ 
tually flying at different heights, since their generalized 
connectivity force prevents any possible inter-robot col¬ 
lision) . 


As robotic platform in both simulations and ex¬ 
periments we used small quadrotor UAVs (Unmanned 
Aerial Vehicles) with a diameter of 0.5 m. This choice 
is motivated by the versatility and construction sim¬ 
plicity of these platforms, and also because of the good 
match with our assumption of being able to track any 
sufficiently smooth linear trajectory in 3D space. 


We further made use of the SwarmSimX environ¬ 


ment (Lachele et al 20121, a physically-realistic simu¬ 
lation software. The simulated quadrotors are highly 
detailed models of the real quadrotors later employed 
in the experiments. The physical behavior of the robots 
itself and their interaction with the environment is sim¬ 
ulated in real-time using Phys!X|^ 


For the experiments, we opted for a highly cus¬ 
tomized version of the MK-QuadrcE We implemented 
a software on the onboard microcontroller able to con¬ 
trol the orientation of the robot by relying on the in¬ 
tegrated inertial measurement unit. The desired ori¬ 
entation is provided via a serial connection by a po¬ 
sition controller implemented within the ROS frame- 
worlj^that can run on any generic GNU-Linux machine. 
The machine can be either mounted onboard or acting 
as a base-station. In the latter case a wireless serial con¬ 
nection with XBee^is used. We opted for the separate 
base station in order to extend the flight time thanks 
to the reduction of the onboard weight. The current 
UAV position used by the controller is retrieved from a 
motion capturing systeirf^ while obstacles are defined 
statically before the task execution. 


4 Simulations and experiments 

In this section, we report the results of an extensive sim¬ 
ulative and experimental campaign meant to illustrate 
and validate the proposed method. The videos of the 
simulations and experiments can be watched in the at- 


tached material and on 

http;//homepage 

IS . laas . fr/ 

afranch!/videos/mult: 

i_exp_conn.html 



To abstract from simulations and experiments, we 
used the TeleKyb software framework, which is thor¬ 
oughly described in Grabe et al (20131. Finally, the de¬ 
sired trajectory (consisting of position, velocity and ac¬ 
celeration) is generated by our decentralized control al¬ 
gorithm implemented using SimulinlEl running in real¬ 
time at 1 kHz. 
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Fig. 6 : Snapshots of a simulation with 20 UAVs in empty space in three different consecutive time instants. The dotted black curves 
represent the planned path 7 ^ to the current target for each robot i (if it has a current target). Blue dots are the robots, the turquoise 
dot is the current ‘prime traveler’. Line segments represent the presence of a connection link between a pair of robots with the following 
color coding: green - well connected, red — close to disconnection. The robots are able to concurrently explore the given targets and 
continuously maintain the connectivity of the interaction graph. 




(a) 


(b) 


(c) 


Fig. 8 : Snapshots of a simulation in the office-like environment in three consecutive time instants. Graphical notation similar to 



4.1 Monte Carlo Simulations 

The proposed method has been extensively evaluated 
through randomized experiments in three significantly 

® http://www.geforce.com/hardware/technology/physx 
^ http://www.mikrokopter.com 
® http://www.ros.org 
® http://www.digi.com/lp/xbee 
http://www.vicon.com 

http://www.mathworks.com/products/simulink/ 


different scenarios. The first scenario is an obstacle free 
3D space and three snapshots of the evolution of the 
proposed algorithm are presented in Fig. The second, 
a more complex, scenario includes a part of a town and 
is reported in Fig. The third is an ofhce-like environ¬ 
ment shown in Fig. The size of the environments is 
50 m X 70 m for both the empty space and the town, 
and about 10 m x 15 m for the office. Since the first two 
environments are outdoor scenarios and the ofhce-like 





































































Decentralized Simultaneous Multi-target Exploration using a Connected Network of Multiple Robots 


15 


environment is indoor, two different sets of parameters 
were employed in the simulations. The values of the 
main parameters are listed in Table 

Table 2: Main parameters of the algorithm used in the 1800 ran¬ 
domized simulative trials in the different scenarios. 


parameter 

empty space and town 

office 

{K,Rs) 

(2.5m, 6m) 

( 1.1 m, 2.5m) 

(Ro,K) 

(0.75 m, 1.75 m) 

(0.25 m, 0.6 m) 

(RcK) 

(1 m, 2.5 m) 

(0.8m, 1.1m) 

(Ag'-,AS“>1) 

(0,1) 

(0,1) 

i?grid 

0.75 m 

0.25 m 

a 

3 

3 

.jjcruise 

3 m/s 

1 m/s for all i 

{Xc,Xm) 

(0.1,0.6)i)f“'=® 

(0.1,0.6)i;f'^‘=® 

At^ 

3 s for all i and k 

3s for all i and k 

Rz 

1 . 8 m 

Im 


The number of robots varied from 10 to 35. In every 
trial 3 targets are sequentially assigned to 5 robots and 
2 targets are sequentially assigned to other 5 robots, for 
a total of 25 targets per trial. The remaining robots are 
given no targets (i.e., they act always as ‘connectors’). 

The configuration of the given targets is randomized 
across the different trials. The same random configura¬ 
tions are repeated for every different number of robots 
in order to allow for a fair comparison among the re¬ 
sults. In the following we refer to the robots with at 
least one target assigned during a trial as ‘explorers’. 

To summarize, we simulated a total number of 1800 
trials arranged in the following way: in each of the 3 
scenes, and for each of the 100 target configurations in 
each scene, we ran a simulation with 6 different numbers 
of robots, namely 10, 15, 20, 25, 30, 35. We encourage 
the reader to also watch the attached video where some 
representative simulative trials are shown. 

In Fig. we show the evolutions of the statistical 
percentiles of: 

— the overall completion time, 

— the mean traveled distance of the 10 ‘explorers’, 

— the maximum Euclidean distance between two ‘ex¬ 
plorers’ 

— the average of A 2 (t) over time along the whole trial 
(we recall that the larger the A 2 the more connected 
is the group of robots, refer to Appendix [A| , 

when the number of robots varies from 10 (i.e., no 
‘connectors’) to 35 (i.e., 25 ‘connectors’). Each column 
refers to one of the 3 different scenarios. 

An improvement with the increasing number of ‘con¬ 
nectors’ in all scenarios is obvious. The mean comple¬ 
tion time (first row) roughly halves when comparing 
0 to 25 ‘connectors’. Adding more than 25 connectors 
will likely produce only minor improvement compared 


to the higher cost of having more robots, since the trend 
becomes basically flat. For this reason we did not per¬ 
form simulations with a larger number of robots. 

In the second row (mean traveled distance) one can 
see how, by already adding a few robots, a reduced 
mean traveled distance is obtained. This can be ex¬ 
plained by the fact that the ‘connectors’ make the ‘ex¬ 
plorers’ less disturbed by other ‘explorers’ with, there¬ 
fore, more freedom to avoid unnecessary detours in reach¬ 
ing their targets. 

Another measure of the reduced task completion 
time is the maximum stretch among the ‘explorers’ (i.e., 
the maximum Euclidean distance between any two ‘ex¬ 
plorers’, see third row). The more connectors, the more 
stretch is allowed: ‘connectors’ in fact provide the sup¬ 
port needed by the ‘explorers’ for keeping graph Q con¬ 
nected while freely moving towards their targets. Only 
the office-like environment does not show this trend in 
the maximum stretch. This is due to the fact that the 
scene is relatively small and therefore the targets are 
not enough spread apart, so no bigger stretch is needed. 

The increased freedom of the ‘explorers’ is also ev¬ 
ident in the plots of the average A 2 (t) (fourth row). 
These plots show how the ‘connectors’ are also useful 
to let the ‘explorers’ move more freely even in small 
environments. In fact, the larger the amount of ‘con¬ 
nectors’, the lower the mean A 2 : with more connectors 
the ‘explorers’ are more able to simultaneously travel 
towards their targets, thus bringing the topology of the 
group closer to less connected topologies (i.e., closer to 
tree-like topologies where the explorers would be the 
leaves of the tree). Clearly, this effect is independent 
of the maximum stretch, in fact the average A 2 follows 
this decreasing trend also in the third ofhce-like envi¬ 
ronment (third column). 


4.2 Experiments 


The experiments involved 6 real quadrotors and were 
meant to test the applicability of the algorithm in a 
real scenario. The parameters of the algorithm used in 
the experiments are reported in Table 

In order to obtain a trajectory smoother than qi (t) 
and, thus, better matching the dynamics capabilities of 
a quadrotor UAV ( Mistier et al||200T l, we made use of 
a fourth order linear filter for each quadrotor: 


= -ki'^'{it)-k2q{{t)-k3q{{t) + k4iqi{,t)-q{{t)) 

(16) 

that tracks the position of the original trajectory qi{t), 
while keeping the velocity, acceleration, and jerk low in 
the filtered trajectory. The tunable gains were chosen 
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Fig. 9: Statistics of the completion times (first row), mean traveled distance of the traveling robots (second row), the maximum 
Euclidian distance between two traveling robots (third row) and mean A 2 (forth row) versus the number of robots in the environments 
empty space (left column), town (middle column) and office (right column). 


as ki = 44, k 2 = 707, k^ = 5090, k^ = 13692 for placing 
the (real negative) poles at approximately —12, —13, 
— 14, —15, then resulting in a settling time of about 
0.3 s within a band of 5%. 


The average norm of all the quadrotors tracking errors 
during the whole experiment is 0.021m, a few short 
peaks are above 0.06 m, and the highest peak is about 
0.098m. 


The resulting trajectory q( (t) is then provided in 
place of qi{t) as input trajectory for the robot i as 
defined in (1), since it results very close to qi{t) as 
shown in Fig. |10a| However, at the same time, it pro¬ 
vides a much smoother reference position signal to the 
quadrotor by filtering off occasional abrupt motions, as 
can be seen in the velocity and acceleration reported 
in Figs. |10b| and [T0^ Figure [TT] shows the norms of the 
UAV errors while tracking the desired trajectory q{ (t). 


For these experiments, we reproduced a scene sim¬ 
ilar to the ofhce-like environment used in simulation, 
see Fig. 12 The UAVs with IDs ‘2’ and ‘4’ (called ‘ex¬ 
plorers’) were given some targets, while the UAVs with 
IDs ‘1’, ‘3’, ‘5’, and ‘6’ (‘connectors’) had no target, for 
then a total of 6 quadrotors. 


The ‘explorer’ 1 (with ID ‘4’) carries an onboard 
camera and has two targets in total. Whenever it reaches 
one of its targets it gives a human operator direct con- 
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a 


60 65 70 75 80 85 90 

time [s] 

(a) 



(b) 



(c) 

Fig. 10: Position, velocity and acceleration of ‘explorer’ 1 during 
a representative period of the experiment, where (jiit) 

are plotted in dash and qf (t), (t), (t) as solid curves. The x, 

y and z component is plotted in red, green and blue respectively. 


trol of the vehicle in the surrounding area of the target. 
Then, with the help of the onboard camera, the human 
operator has the task of searching for an object in the 
environment. When the object is found by the human 
operator, the task at the target is considered completed, 
and the UAV switches back to autonomous control. In 
order to allow full human control of ‘explorer’ 1 in the 
anchoring behavior, the UAV is temporarily decoupled 
from the point 94 , which is instead kept close to the tar¬ 
get by the action of /anchor (as desired). The ‘explorer’ 2 
(with ID ‘2’) is instead fully autonomous and is assigned 


Table 3: Main parameters used in the experiments. 


parameter 

value 

(R's^Rs) 

(1.4m, 2.5m) 

{Ro,K) 

(0.5 m, 0.75 m) 

{RcK) 

(1.0m, 1.4m) 


(0,1) 

Rgrid 

0.2 m 

a 

3 

.j-cruise 

0.5 m/s 

{xc,xm) 

(0.2, 0.7) wiruiae 

At>i 

3 s for all i and k 

Rz 

0.75 m 



40 60 80 

time [s] 


Fig. 11: Plots of the 6 norms of the position error between qiit) 
and the corresponding real quadrotor trajectory, for i = 1,..., 6. 
The average error norm is 0.021 m. 


with a total of 4 targets. At the first target location, the 
task is to pick up an object to then be released at the 
second target location. The same task is subsequently 
repeated with targets 3 and 4. We note, however, that 
the pick and place action is only virtually performed 
since the employed quadrotors are not equipped with 
an onboard gripper. We also stress that all these op¬ 
erations are performed concurrently while keeping the 
topology of the group connected at all times. 

A video of the experiment is present in the attached 
material and can be found under the given link above. 

Table [^reports and describes all the relevant events 
taking place during an experiment in a chronological 
order. 


Figure [ 12 ] shows the top-view of the ‘explorer’ paths 
for five representative time periods: Ti = [0,25] s in 
Fig. [T 2 a]r 2 = [25,60]s in Fig. pb| T 3 = [60,80]s 
in Fig. 12c T 4 = [80,120] s in Fig. 12d and finally 
T 5 = [120,129] s in Fig. 12e Every plot shows the (con¬ 
nected) graph topology of the group at the beginning 
of the time interval (dashed black lines) and the paths 
of the 2 ‘explorers’ (solid lines, blue for the ‘explorer’ 1 
and red for the ‘explorer’ 2). The initial positions of 
the robots are shown with colored circles and are la¬ 
beled with the IDs of the corresponding robots. The 
two small blue squares represent the two desired target 
locations of the ‘explorer’ 1. The two green squares and 
the two red squares represent the two pick positions and 
release positions of ‘explorer’ 2, respectively. Finally, 
the vertical walls of the environment are shown in gray. 
Figure |12f| on the other hand shows the z-coordinate 
of all the six quadrotors in order to understand the 3D 
motion in the 2D projections of Figs. |12a| to [l2el 

Figure[^shows three screenshots of the experiment: 
the lines between two quadrotors represent the corre¬ 
sponding connecting link as per graph Q. 


Finally, Fig. [^reports nine plots that capture the 
behavior of several quantities of interest throughout the 
whole experiment. As can be seen in Fig. |14a] the gen¬ 
eralized algebraic connectivity eigenvalue \ 2 {t) (see Ap- 
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Table 4: Chronological list of important events in the experiment. 


Fig. 12 

Time 

Events 


Os 

The experiment starts. Both ‘explor¬ 
ers’ are assigned a target and since ‘ex¬ 
plorer’ 2 is closer to its goal, it becomes 

Fig. 12a 


‘prime traveler’, while ‘explorer’ 1 is ‘sec¬ 
ondary traveler’. 


22 s 

‘Explorer’ 2 arrives at its first target, 
where it should pick up an object. There¬ 
fore ‘explorer’ 2 goes into ‘anchor’ and 
‘explorer’ 1 becomes ‘prime traveler’. 


29 s 

‘Explorer’ 2 has completed the pick-up 
action and receives the point to release 
the object as a new target. Since ‘ex¬ 
plorer’ 1 is still ‘prime traveler’, ‘ex- 

Fig. 12b 


plorer’ 2 becomes ‘secondary traveler’. 

35 s 

‘Explorer’ 1 arrives at its target, where 
the human operator takes control of the 
UAV and use its camera to find a yellow 
picture on the wall. ‘Explorer’ 2 then be¬ 
comes ‘prime traveler’. 


56 s 

‘Explorer’ 2 arrives at the target where 
it needs to release the object. 


63 s 

‘Explorer’ 2 has completed the releasing 
action and receives the next pick-up lo¬ 
cation. ‘Explorer’ 1 is still under the con¬ 
trol of the human operator and therefore 
in an ‘anchor’ state, so ‘explorer’ 2 di- 

Fig. 12c 


rectly becomes ‘prime traveler’. 

65 s 

The human operator finds the picture 
on the wall, ‘explorer’ 1 becomes au¬ 
tonomous again and starts to move to¬ 
wards its next target as ‘secondary trav¬ 
eler’, since ‘explorer’ 2 is ‘prime traveler’. 


78 s 

‘Explorer’ 2 arrives at the location where 
to pick up the second object and goes 
to the ‘anchor’ state. Hence ‘explorer’ 1 
becomes ‘prime traveler’. 


85 s 

‘Explorer’ 2 has completed the pick-up 
action and starts moving towards the re¬ 
leasing location as ‘secondary traveler’. 

Fig. 12d 

100 s 

‘explorer’ 1 arrives at its target, goes 


to ‘anchor’ state and is thus under con¬ 
trol of the human operator, therefore ‘ex¬ 
plorer’ 2 becomes ‘prime traveler’. 


119s 

‘Explorer’ 2 arrives at its final target and 
switches into ‘anchor’. 


123 s 

The human operator finds the searched 
object and ‘explorer’ 1 becomes ‘connec¬ 
tor’ since it has no new target location. 

Fig. 12e 

126 s 

‘Explorer’ 2 has completed the releasing 


action and becomes a ‘connector’ since 
it has also no new target. 


129 s 

No UAV has a next target and the ex¬ 
periment ends. 



Fig. 12: (a))(e) Top view of the 3D paths of the ‘explorers’ (solid 
blue and red curves) during the experiment in five representative 
time intervals. The interaction graph at the beginning of each 
interval is shown with black dashed lines. The ID of each robot 
is shown besides the circle representing the starting position of 
each robot at the beginning of the corresponding interval. Tar¬ 
gets are represented with colored squares and walls are gray. The 
specific time intervals are: |(a)| Ti = [0, 25] s, |(b)| T 2 = [25,60]s, 
Ts = [60,80 ]s,[^T 4 ^0,120] sand [(eJlTl = [120,129] s. 
2 -coordinate of the positions of all six quadrotors to help in- 


(c) 


(f) 


terpreting the 2D projection reported in the plots (and videos). 
The large vertical motion of ‘explorer’ 1 (blue) is due to the hu¬ 
man operator flying this robot, while the subsequent descent is 
autonomously performed thanks to the proposed algorithm. 


pendix]^ remains positive for any t > 0, thus implying 
continuous connectivity of the graph Q as desired. The 
time-varying number of edges in Fig. |14b| shows the 
dynamic reconfiguration of the group topology which 
ranges between topologies with 5 edges (the minimum 
for having Q connected) and topologies with up to 10 


edges. This plot clearly shows how the adopted con¬ 
nectivity maintenance approach can cope with time- 
varying graphs. In Fig. |14c[ we report the stretch of 
the group, defined as the maximum Euclidean distance 
between any two robots at a given time t. One can 
then appreciate how this stretch varies among 3.5 and 
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(a) 


(b) 


shows the side view of the scene from a fixed 
shows the view taken from the onboard camera 


Fig. 13: Three simultaneous screenshots of the experiment described in the text: 
camera. Connections between UAVs (brightened areas) are overlayed as green lines, 
of the ‘explorer’ 1 using the same highlighting. |(c)| shows a 3D synthetic reconstruction of the robot positions and connections are 
shown with a line given in green when the weight is high, red shortly before a connection breaks and as a gradient in between. The 
robot that is marked with the red sphere is currently decoupled and controlled by the human operator. 




(b) 



(e) 



(h) 



(c) 



(f) 



(i) 


Fig. 14: Behavior of different measurements during an experiment: (a) A 2 always keeps greater than zero, thus showing how the group 


remains connected at all times, [(^ the number of links |£’(i)| of the interaction graph ^(t), [(^ the stretch of the formation given by the 
maximum Euclidean distance between any two quadrotors over time, [(d)] the exploration states with the following meaning: 1: ‘prime 
traveler’, 2: ‘secondary traveler’, 3: ‘anchor’, 4; ‘connector’, |(e)| the position difference between the virtual point of the connectivity 
maintenance and the commanded position to the quadrotors showing the decoupling as an ‘anchor’, |(f)| the traveling efficiency of the 
current ‘prime traveler’ (see ( |12[ |), [(^ the estimation of the traveling efficiency by the ‘secondary travelers’ (see l |13[ l), [(h)| the force 
direction alignment for the ‘secondary travelers’ (see ( |10[ |)) [(i)| the adaptive gain used by the ‘secondary travelers’ to scale down their 
traveling force (see O). 


7.5 meters thus exploiting at most the allowable ranges 
of the experimental arena. Notice also how the stretch 
is in general larger when the number of links (and con¬ 
sequently A 2 (t)) is smaller. In fact the two peaks at 
about 60s and 103s occur when the robots are forced 
into a sparsely connected topology because the two ‘ex¬ 
plorers’ have concurrently reached their farthest target 
pairs, i.e., {Ai,B 2 ) and {Bi,D 2 ). 


Figure |14d| shows the ‘explorer’ states state 2 and 
state 4 over time, with a dashed blue line and solid red 
line, respectively. In the plot, the following code is used: 
1 = ‘prime traveler’, 2 = ‘secondary traveler’, 3 = 
‘anchor’ and 4 = ‘connector’. For i = 1,3,5,6 it is 
states = 4 for all t G [0,129]. Notice that, because of the 
algorithm design, at most one ‘explorer’ has states = 1 
at any given time. 
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The temporary decoupling of the ‘explorers’ from 
the points q 2 and during their anchoring behavior 
can be appreciated in Fig. |14e| where the Euclidian dis¬ 
tance between the real robot position in the trajectory 
and the corresponding qi{t) is shown, for i = 2,4. ‘Ex¬ 
plorer’ 1 (solid red line) decouples four times in total, 
in correspondence of the 2 pick-and-place operations, 
which gives rise to 4 short peaks in the plot. ‘Explorer’ 2 
(dashed blue line) decouples two times in total, in corre¬ 
spondence of the 2 human-in-the-loop operations, caus¬ 
ing 2 long peaks in the plot. 


Figure 14f shows the traveling efficiency Ap of the 
current ‘prime traveler’ with a dashed blue line when 
‘explorer’ 1 is the ‘prime traveler’ and with a solid red 
line when ‘explorer’ 2 is the ‘prime traveler’. The es¬ 
timation Ap of this value (see @) by all robots that 
are currently not ‘prime traveler’ is given in Fig. |15| 
We chose fcyi = 1 resulting in a relatively slow propa¬ 
gation to show the additional robustness of our algo¬ 
rithm against this parameter (and the simple adopted 
consensus propagation), but clearly one could easily 
employ higher gains. To make it easier for the reader 
to understand the following discussion, we show again 
in Fig. |14g| the essential information of this last plot 
whenever a robot is a ‘secondary traveler’. In Figs. |14h| 
and 14i the force direction alignment 0i (see @) and 
the adaptive gain pi (see of the current ‘secondary 
traveler’ are shown. In the latter three plots a dashed 
blue line indicates when ‘explorer’ 1 is the ‘secondary 
traveler’, and a solid red line when ‘explorer’ 2 is the 
‘secondary traveler’. 


To fully understand the important features of our 
method, we now give a detailed description of the time 
interval [0, 22] in the Figs. 14f to 14i A similar pattern 
can then be found in the rest of the experiment. In this 
time interval, the ‘explorer’ 2 is the ‘prime traveler’, 
while ‘explorer’ 1 is a ‘secondary traveler’ (and the rest 
are ‘connectors’). Due to the initial transient of its mo¬ 
tion controller, the ‘prime traveler’ starts with Ap = 0 
and quickly reaches Ap = 0.6. Shortly after, the travel¬ 
ing efficiency decreases again since ‘explorer’ 2 reaches 
the end of the area where it can freely move and, thus, 
needs to ‘pull’ the other robots for preserving connec¬ 
tivity of Q. This effect is propagated to the ‘explorer’ 1 
as shown in Fig. |14g[ The force direction alignment be¬ 
tween the traveling force and the generalized connectiv¬ 
ity force is shown in Fig. |14h| Combining these two plots 
with (151 allows to understand the effect of Fig. 14i As 
can be seen, the ‘secondary traveler’ slows down its mo¬ 
tion to around 10% for roughly 5 seconds. This enables 
the ‘prime traveler’ to travel faster again (see Fig. 14fl. 
However, since the ‘explorer’ 2 needs to move around 
the wall (see Fig. 12a I to reach its target, it needs to 



Fig. 15: Estimation of the ‘prime traveler’ traveling efficiency of 
all the six robots, whenever they are currently not a ‘prime trav¬ 
eler’ (see ( |13^ ). The color scheme for the robots is as in Fig. |12| 


‘pull’ the other robots even more for preserving connec¬ 
tivity. Therefore, the traveling efficiency becomes zero 
and, although the direction alignment of the ‘secondary 
traveler’ becomes higher, the overall gain pi stays very 
low: this makes it possible for the ‘prime traveler’ to 
eventually reach its target. We recall here that, accord¬ 
ing to Table and ([^, Ap = 1 as soon as the ‘prime 
traveler’ achieves a speed of at least 80% of its desired 
cruise speed (so the error is less than 20%), while Ap = 0 
means a speed of less than 30% (an error of more than 
70%), and not necessarily a zero velocity. 


5 Conclusions 

In this paper we presented a novel distributed and de¬ 
centralized control strategy that enables simultaneous 
multi-target exploration while ensuring a time-varying 
connected topology in a 3D cluttered environment. We 
provided a detailed description of our algorithm which 
effectively exploits presence of four dynamic roles for 
the robots in the group. In particular, a ‘connector’ 
is a robot with no active target, an ‘anchor’ a robot 
close to its desired location, and all other robots are in¬ 
stead moving towards their targets. Presence of at most 
one ‘prime traveler’, holding a leader virtue, is always 
guaranteed. All other robots (‘secondary travelers’) are 
bound to adapt their motion plan so as to facilitate the 
‘prime traveler’ visiting task. This feature ensures that 
the ‘prime traveler’ is always able to reach its target, 
and thus ultimately allows to conclude completeness of 
the exploration strategy. The scalability and effective¬ 
ness of the proposed method was shown by presenting 
a complete and extensive set of simulative results, as 
well as an experimental validation with real robots for 
further demonstrating the practical feasibility of our 
approach. 

As future development, we plan to modify the con¬ 
trol of the ‘connectors’ in order to actively improve the 
connectivity (e.g., moving towards the center of the 
group or towards the closest ‘explorer’) and therefore 
decrease the overall completion time even more. An- 
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other extension could include imposing temporal tar¬ 
gets that expire before any robot can possibly reach 
them. In our framework this could be easily achieved by 
letting the corresponding ‘prime traveler’ or ‘secondary 
traveler’ switching into a ‘connector’ whenever a target 
expires, for then automatically starting to explore the 
next target (if any). 

An important direction worth of investigation would 
also be the possibility to (explicitly) deal with errors 
or uncertainties in the relative position measurements 
(w.r.t. robots and obstacles) needed by the algorithm. 
Indeed, the presented results rely on an accurate mea¬ 
surement of robot and obstacle relative positions ob¬ 
tained by means of an external motion capture system. 

Another improvement could address the distributed 
election of the ‘prime traveler’ as was already discussed 
in Sec. |3.3| Indeed, while the adopted flooding approach 
does not require presence of a centralized planning unit, 
it still needs to take into account information from all 
robots. It would obviously be preferable to only exploit 
information available to the robot itself and its 1-hop 
neighbors. This could be achieved by leveraging some 
(suitable variant of the) consensus algorithm as done 
for the decentralized propagation of the traveling effi¬ 
ciency of the current ‘prime traveler’. More generally, 
it might also be beneficial to improve the election of 
the ‘prime traveler’ by considering other criteria than 
the Euclidean distance w.r.t. a target which may not 
always result in an ‘optimal’ group motion (e.g., when 
obstacles, such as a wall, are present between the next 
‘prime traveler’ and the target). The election could for 
instance choose the robot with the highest chance of 
reducing even further the completion time, e.g., based 
on the current motion of the group or direction of the 
majority of current targets of all ‘secondary travelers’. 

Finally, it would be interesting to obtain an analyti¬ 
cal upper bound of the total exploration time for our ap¬ 
proach, although, in our opinion, deriving such a bound 
is unfortunately not so straightforward. Clearly, the 
considered multi-target exploration scenario has some 
analogies with the multiple traveling salesman prob¬ 
lem (?), where a certain number N of agents are asked 
to And a set of N shortest routes through a set of m 
cities and return back to the start. Nevertheless, an 
analysis based on the multiple traveling salesman prob¬ 
lem would not easily extend to our case because of the 
constraint of continuous connectivity maintenance. 


A Appendix 


For the sake of completeness and readablity, we will recap here 
the main features of the connectivity maintenance algorithm pre¬ 
sented in |Robuffo Giordano et'^ ( |2013l with some small changes 


in the variable names. We start by defining dij = \\qi — qj\\ as 
the distance between two robot positions qi and qj, and dijo = 
min^^[0^i],oGO ~ Qi) ~ ^11 closest distance from 

the line of sight between robot i and j to any obstacle. 

The main conceptual steps behind the computation of can 
be summarized as follows: 


1. Define an auxiliary weighted graph (f) = (V, VF), where 
VF is a symmetric nonnegative n x n matrix whose entries 
Wij represent the weight of the edge (i,j) and {i,j) G ^ 
Wij > 0 . 

2. Design every weight Wij as a smooth function of the robot 
positions qi , qj and of the obstacle points surrounding qi and 
qj , with the property that Wij 0 if and only if at least one 
of the following conditions is verified: 

(a) the maximum sensing range Rs is reached: dij > Rs, 

(b) the minimum desired distance to obstacles Ro is reached 
(where Ro < Rm)'- dijo < Ro', 

(c) the minimum desired inter-robot distance Rc is reached: 
dik ^ Rc for at least one k ^ i. 

3. Compute as the negative gradient of a potential func¬ 

tion F’^(A 2 ) that grows unbounded when A 2 Ag^^^ from 
above, where A 2 is the second smallest eigenvalue of the 
(symmetric and positive semi-definite) Laplacian matrix L — 
diag7_-i Wij) — W, and AJ*^^ is a non-negative param¬ 

eter. This eigenvalue A 2 is often also called Fiedler eigenvalue. 


It is known from graph theory that a graph is connected if and 
only if the Fiedler eigenvalue of its Laplacian is positive < |FiedIer| 


1973 I. If ^^(0) is connected, and in particular A2(0) > A^' 
then under the action of the value of A 2 (t) can never decrease 
below AJ^^^ and therefore Q^{t) always stays connected. 

From a formal point of view the anti-gradient of for the 
2 -th robot takes the form 


dV^{X2) 

dqi 


dV^ d\2 
d\2 dqi 


(17) 


Moreover, if the formal expression of and W are known then 
jnL can be analytically computed via the expression (|Yang et al| 
|2010| l, 


d\2 

dqi 


E 


dWi, 

dqi 


('^2, 




(18) 


where 1 ^ 2 ^ is the 2 -th component of the normalized eigenvector of 
L associated to A 2 . 

In order to have a fully decentralized computation of /A, the 
robots perform a distributed estimation of both A 2 (£) and U 2 ^ (t), 
for all 2 = 1,..., N, as shown in [Yang et al| l |2010| |. In |Robuffo| 
[Giordano et al| ( |2013[ l the authors finally prove the passivity (and 
then the stability) of the system w.r.t. the pair (fi, Vi) for all i = 
1 ,..., Y, as we ll as the possibility to compute the connectivity 
force /A in 0 in a completely decentralized way. 
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